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Abstract 

(3JQ[ In this work we explore error-correcting codes derived from the "lifting" of "affine-invariant" 

codes. Affine-invariant codes are simply linear codes whose coordinates are a vector space over 
a field and which are invariant under affine-transformations of the coordinate space. Lifting 
takes codes defined over a vector space of small dimension and lifts them to higher dimensions 
by requiring their restriction to every subspace of the original dimension to be a codeword of 
the code being lifted. While the operation is of interest on its own, this work focusses on new 
ranges of parameters that can be obtained by such codes, in the context of local correction and 
testing. In particular we present four interesting ranges of parameters that can be achieved by 
such lifts, all of which are new in the context of affinc-invariance and some may be new even in 
general. The main highlight is a construction of high-rate codes with sublinear time decoding. 
The only prior construction of such codes is due to Kopparty, Saraf and Yekhanin [53] . All our 
codes are extremely simple, being just lifts of various parity check codes (codes with one symbol 
of redundancy), and in the final case, the lift of a Reed-Solomon code. 
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1 Introduction 



In this work we explore the "locality properties" of some highly symmetric codes constructed by 
"lifting" "affine-invariant" codes. We describe these terms below. 

1.1 Basic terminology and background 

We start with some standard coding theory preliminaries. Let F^ denote the finite field of cardinality 
q and for any finite set D, let {D — > ¥ q } denote the set of all functions from D to ¥ q . In this work, 
a code on coordinate set D is a set of functions J- C {D — > ¥ q }. A code J- is said to be linear if 
it forms a vector space over ¥ q , i.e., if for every f,g G T and a G ¥ q the function af + g G J-. 
We refer to N = \D\ as the length of the code. A second parameter of interest is the dimension 
of the code which is the dimension of J 7 as a vector space. The dual of a code denoted J 7 ^, is 
the set of functions {g : D — » ¥ q \(f,g) = V/ € J 7 }, where (f,g) = YlxeD f( x )d( x ) denotes the 
standard inner product of vectors. Let wt(/) = |{x G D\f(x) ^ 0}| denote the weight of /. Let 
3(f,9) = |{^ £ D\f(x) 7^ g(x)}|/|-D| denote the (normalized Hamming) distance between / and g. 
(So 5(f,g) = wt(/ — g)/\D\.) We say / is 5-close to g if S(f,g) < 5 and <5-far otherwise. We say 
/ is 5-close to J- if there exists g G JF that is 5-close to / and <5-far otherwise. We say J 7 is a code 
of distance 5 if every pair of distinct codewords in T are <5-far from each other. We use ^(J 7 ) to 
denote the maximum 5 such that J 7 is a code of distance 5. 

In this work we explore some aspects of affine-invariant codes. In such codes the domain D is a 
vector space F^i, i.e., an m-dimensional vector space over the n-dimensional extension field of the 
range ¥ q . Let Q = q n and let ¥q denote the field of size Q. We say a function A : ¥q — > ¥q is an 
affine function if A(x) = M ■ x -\-b for some matrix M G Fg Xm and vector b G ¥q . We say A is an 
affine permutation if M is invertible. A code T C {¥q — > ¥ q } is said to be affine-invariant if for 
every affine permutation function A : ¥q — > ¥q and for every / G T the function / o A given by 
(/ o A)(x) = f(A{x)) is also in J 7 ^ 

Affine-invariant codes are of interest to us because they exhibit, under natural and almost 
necessary conditions, very good locality properties: they tend to be locally testable and locally 
correctible. We introduce these notions below. We say a code T is (k, 5)-locally correctible ((k,5)- 
LCC) if there exists a probabilistic algorithm Corr that, given x G D and oracle access to a 
function / : D —> ¥ q which is (5-close to some g G J 7 , makes at most k queries to / and outputs 
g(x) with probability at least 2/3. We say that T is (k, e, <5)-locally testable ((k, e, <5)-LTC) if J- is 
a code of distance 5 and there exists a probabilistic algorithm Test that, given oracle access to 
/ : D — > ¥ q , makes at most k queries to / and accepts / G T with probability one, while rejecting 
/ that is T-far from T with probability at least e • r. 

1.2 This work: Motivation and Results 

As noted above affine-invariant lead naturally to locally decodable codes and locally testable codes. 
In this work we use a certain lifting operation to exhibit codes with very good locality. We start 
by defining the lifting operation. For a function / : Fg — > ¥ q and set S C Fg let f\s denote the 
restriction of / to the domain S. 

In some of the earlier works invariance is defined with respect to all affine functions and not just permutations. 
In Section we show that the two notions are equivalent and so we use invariance with respect to permutations in 
this paper. 
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Definition 1.1 (Lifting). For a code J- C |Fg — > F g }, and integer m > t its m- dimensional lift 
Lift m (J") C {F^ -> FJ is the code 

{/ : Fq — > Fg | f\v £ J~ for every t- dimensional affine sub space V C Fg}. 

(Note that the definition above assumes some canonical way to equate i-dimensional subspaces 
of Fg with Fg. But for affine-invariant families T the exact correspondence does not matter as 
long as the map is an isomorphism.) 

The lift is a very natural operation on affine-invariant codes, and builds long codes from shorter 
ones. Indeed, lifts may be interpreted as the basic operation that leads to the construction of 
"(Generalized) Reed-Muller" codes, codes formed by m-variate polynomials over F q of total degree 
at most d: Such codes are the "lifts" of t-variate polynomials of degree at most d, for t = \ q d ^ j j p ~\ 
where p is the characteristic of q. (This follows from the "characterization" of polynomials as 
proven in [23].) While the locality properties (testability and correctability) of Reed-Muller codes 
are well-studied [28J El HJ [27l CEJ EH [20l UM US] , they are essentially the only rich class of symmetric 
codes that are well-studied. The only other basic class of symmetric codes that are studied seem 
to sparse ones, i.e., ones with few codewords. 

In this work we explore the lifting of codes as a means to building rich new classes of dense 
symmetric codes. (In Theorems 1 1 . 2 1 - fl . 5 1 b elow we describe some of the codes we obtain this way, 
and contrast them with known results.) Along the way we also initiate a systematic study of lifts 
of codes. Lifts of codes were introduced first in [7], who explored it to prove negative results - 
specifically, to build "symmetric LDPC codes" that are not testable. (Their definition was more 
restrictive than ours, and also somewhat less clean.) Our work is the first to explore positive use 
of lifts. 

We remark that all codes constructed by lifting have relative distance of at least Q~ l and are 
(Q*, Q-*/3)-LCC's and (Q*, n(Q~ 2t ), Q-*)-LTC's. The local correctability follows directly from 
their definition, while the local testability is a consequence of the main result of [241 Theorem 2.9]. 
(See also Proposition 12.61 ) This general feature suffices for three of our code construction, while in 
the fourth case we have to analyze the decodability a little more carefully. 

The constructions For simplicity all codes are described for the case of fields of characteristic 
two. The first three codes are described only for q = 2 while the construction does generalize 
to other fields. These codes are obtained by the lifting of the parity-check code. By making 
appropriate choices of Q and t we get codes with different locality (and distance) . The fourth code 
works over large fields only and is obtained by lifting the Reed-Solomon code. 

Our first code has constant locality k, for k being a power of 2. If the length of the code is N 
(in our setting ./V = Q m ), then the code has dimension f2fc((log N) k ). 

Theorem 1.2. For every positive integer t and k = 2 , there exists a constant c& > such that 
for every positive integer m and N = 2 m , there exists a code of length N, dimension at least 
c k (logN) k - 2 which is a (k, k" 1 /2>)-LCG, and a [k,Q,(k- 2 ),k- l )-LTC. 

To contrast this with other known codes, essentially the only symmetric binary code known 
in this regime is the Reed-Muller code, which has dimension fi((log N)^ ogk ) for locality k. Thus 
our code has significantly greater dimension in this regime. Our results are also asymptotically 
optimal for affine-invariant codes, by a result of Ben-Sasson and Sudan [9] which shows that any 
affine-invariant code with such local correctability or testability must have dimension (log N) k+0 ^\ 
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For local correctability, these codes asymptotically match the performance of best-known codes, 
which would be obtained by taking Generalized Reed-Muller codes over a field of size roughly k 
and then composing it with some binary code. Our codes are simpler to describe and the symmetry 
comes without any loss of parameters. Furthermore, for really small constants, say k = 4 or k = 8, 
these codes seem to be better than previously known locally correctible codes. 

Our next two codes consider relatively large locality (growing with N). The advantage with 
these codes is that the redundancy (the difference between the length and the dimension) grows 
exceedingly slowly. The first of these two codes considers the setting where the locality is N e for 
some positive (but tiny) e. In such cases, we get codes of dimension N — N 1 " 6 where e' > if 
e > 0. Thus the dimension is extremely close to the length. 

Theorem 1.3. For every e > there exists e' > such that for infinitely many N , there is a code 
of length N, dimension N - jV w which is a (N e ,N- e /3)-LCC and a (iV e , 0(iV- 2£ ), N~ e )-LTC. 

The codes from Theorem 11.31 are not new. These codes, and in particular their exact dimen- 
sion are well-known in the literature in combinatorics [1 H 129] , Their locality was first noted by 
Yekhanin [30] who noticed in particular that they are LCCs. Our main contribution is to note 
that these are (naturally) obtained from lifts. In the process we get that these are affine-invariant 
codes and so are also LTCs, a fact that was not known before. Finally, our bounds while cruder, 
give better asymptotic sense of the redundancy of these codes (and in particular note that the 
redundancy is sublinear in the code length). 

We remark that these codes have very poor distance and very poor error-correcting capability. 
However, in the context of applications such as constructions of PCPs (probabilistically checkable 
proofs, see e.g., [2]) one does not need distance or error-correction capability per se. All one seems 
to need is the local correction and decoding capability. So the theorem above motivates the search 
for extremely efficient PCPs, where the difference between the length of the PCP and the length 
of the classical proof is sublinear, while allowing for sublinear query complexity. Such a result, if 
at all possible, would really be transformative in the use of PCPs as a positive concept. 

Next, we consider codes of locality Q(N), so linear in the length of the code. This range 
of parameters was motivated by the recent result of Barak et al. [5] who used such codes (with 
additional properties that we are not yet able to prove) to build "small-set expanders" with many 
"large eigenvalues". We won't describe the application here, but instead turn to the parameters 
they sought. They wanted codes of length N with locality eN and dimension N — poly(log N). The 
codes they used were Reed-Muller codes. By exploring lifts we are able to suggest some alternate 
codes. These codes do have slightly better dimension, though unfortunately, the improvement is 
not asymptotically significant (and certainly not close to any known limits). Nevertheless we report 
the codes below. 

Theorem 1.4. For every e > and for infinitely many N, there is a code of length N , dimension 
N- O e ((logA0 logl/e ), which is a (eN ^{eN^-LCC and a (eN, Q((eN)~ 2 ), (eiV) _1 )-iTC. 

Finally, we describe the most interesting choice of parameters. Our final code has locality N s 
for arbitrarily small 5 > 0, while achieving dimension (1 — e)iV for arbitrarily small e > 0. While 
the dimension of this code is smaller than that of the codes of Theorem 11.41 it corrects a constant 
positive fraction of errors. 

Theorem 1.5. For every e,5 > there exists r > such that for infinitely many N, there is a 
code of length N over 

^Q, for Q » N 5 , of dimension (1 - e)N which is a (N s ,t)-LCC. 
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Till 2010, no codes achieving such a range of parameters were known. In particular no code was 
known that achieved dimension greater than N/2 while achieving o(N) locality to correct constant 
fraction of errors. In 2010, Kopparty et al. [26] introduced what they called the "multiplicity codes" 
which manage to overcome the rate 1/2 barrier. Other than their codes, no other constructions 
were known that achieved the parameters of Theorem 11.51 and our construction provides the first 
alternate. We remark that while qualitatively our theorem matches theirs, the behavior of r as a 
function of e and 5 is much worse in our construction. Nevertheless for concrete values of N, e and 
8 our construction actually seems to perform quite well. 

Theorems II. 2111. 51 are proved in Section [3l While each of the codes above may be of interest 
on their own, the underlying phenomenon, of constructing codes with interesting parameters by 
lifting shorter codes is an important one. Given our belief that lifting is an important operation 
that deserves study, we also do some systematic analysis of lifts. In particular in this work we show 
that lifting of a base code essentially preserves distance. This preservation is not exact and we give 
examples proving this fact. 

Previous work on affine-invariance The study of invariance, and in particular affine-invariance, 
in property testing was initiated by Kaufman and Sudan [23] and there have been many subsequent 
works [SI [HI HZ1 [13 [25l [22I El El HH El HI]. Most of the works, with the exceptions of [251 [22], 
study the broad class with the aim of characterizing all the testable properties. The exceptions, 
Kaufman and Wigderson [25] and Kaufman and Lubotzky [22], are the few that attempt to find 
new codes using invariance. While the performance of their codes is very good, unfortunately they 
do not seem to lead to local testability and the performance is too good to be locally decodable 
(or locally correctible). Our work seems to be the first in this context to explore new codes that 
do guarantee some locality properties. 

A second, more technical, point of departure is that our work refocusses attention on invariance 
of "multivariate properties" . Since the work of [23] most subsequent works focussed on univariate 
properties. While this study seemed to be without loss of generality, for the purpose of constructions 
it seems necessary to go back to the multivariate setting. One specific contribution in this direction 
is that we show that invariance under general affine-transformations and under affine-permutations 
lead to the same set of properties (see Section lAl). 

Organization. In Section[2]we present some of the background material on affine-invariant codes 
and present some extensions in the multivariate setting. In Section [3] we describe our codes and 
analyze them. In Section [4] we describe how distance of lifted codes behave. Some of the technical 
proofs are deferred to the appendix. 

2 Preliminaries 

In this section we describe some basic aspects of affine-invariant properties, specifically their degree 
sets. We mention in particular the fact that the size of degree sets determines the dimension of a 
given affine-invariant code. Finally we conclude by relating the degree set of a base code to the 
degree set of a lifted code. In later sections we will use this relationship to lower bound the size of 
the degree set of lifted codes, and thus lower bound their dimension. 

For a function / : ¥'q — > ¥ q , we associate with it the unique polynomial in Fq[xi, . . . ,x m ] of 
degree at most Q — 1 in each variable that evaluates to /. (We abuse notation by using the same 
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notation to refer to a function and the associated polynomial.) For d = (d\, . . . ,d m ) and x = 
(xi, . . . , x m ), let x d denote the monomial YYiLi x t' • For a function / = ^ d CdX d , let its support, 
denoted supp(/), be the set of degrees with non-zero coefficients in /, i.e., supp(/) = {d | ^ 0}. 

Definition 2.1 (Degree set). For a code T C {Fq — > ¥ q }, its degree set, denoted Deg(J-") ; is the 
set Degp 7 ) = U/ 6 jrsupp(/). For a set D C {0, . . . , Q — l} m , let its code, denoted Fam(^) be the 
set Fam{T) = {/ : Fq ->■ ¥ q | supp(/) C D}. 

For an affine-invariant code, its degree set uniquely determines the code and in particular the 
following proposition holds. 

Proposition 2.2. For linear affine-invariant codes T C {Fq — > ¥ q }, we have Fam(Deg(J 7 )) = T . 

We prove the proposition below. The proof uses some basic facts about linear affine-invariant 
codes that are proved in Section [Aj (We note that this would be the logical place to read/verify 
the contents.) 

Proof. Trivially T C Fam(Deg(J 7 )). For the other direction, consider / G Fam(Deg(J-")). Express 
/ = Trog (see, e.g., Proof of Lemma IA.7|) . where g G Fq[x] is chosen among all such to be 
minimal in its support. We have supp(g) C supp(/) C Degp 7 ). Suppose g = SdeDcg(j") c d xd ; 
then by Lemma IA.7I we have Tr(cdX d ) € T for every d. Now by linearity of J- it follows that 
^ d Tr(cdX d ) G J 7 , but by the linearity of the Trace function we have that this function is /. □ 

Our reason to study the degree sets is that the size of the degree set gives the dimension of a 
code exactly. 

Proposition 2.3. For a linear affine-invariant code J- C {Fq — > ¥ q }, we have the dimension of 
T equals \ Deg^)) . 

Proof We generalize the proof of [6l Lemma 2.14] to the multivariate setting. For degree d G 
{0, . . . ,Q-l} m , define 5(d) = {g*d | i G Z}. For every d,e, either 5(d) = 5(e) or 5(d)n5(e) = 0. 
Write / G T as /(x) = X^deDcg^) /d xd - Since f q = f, it follows that / g .d = /d for all d and hence 
fd £ F |s( q )|. From each 5(d) pick a representative, and let 5 be the set of these representatives, so 
that Deg^) = Udes5(d) is a partition. Then we may write /(x) = X^de5^ rF is(d)i ,F«j(/d xd )- For 

each d G 5 there are gl s ( d )l choices for /d, so the total number of choices for / is rides'?' 5 ^' = 
g Edesl5(d)| =q \Deg(F)\ D 

Next we attempt to describe how the degree set of a lifted code can be determined from the 
degree set of a base code. 

It turns out that the structure of the degree set (not every set D is the degree set of an affine- 
invariant code) is strongly influenced by the base p representation of its members. We start with 
some notions related to such representations. For non- negative integers a and b, let a^°\ . . . , 
and b(°\ b^\ . . . , be their base p expansion, i.e., < < p, a = ^ a^p 1 and b = b^p % . 

We say a is in the p-shadow of b, denoted a < p b, if < few for every i. We extend the notation 
above to vectors and matrices if the relationship holds coordinatewise, so for A, B G Z nx ^ A < p B 
if (A)ij < p (B)ij for every pair (i,j) G [n] x [£]. 

Finally, we extend the notion to compare vectors to integers (and matrices to vectors). For 
e G Z and d G Z we say e < p d if for every subset 5 C [£] and for every f < p e we have 
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^2i£S fa —p d- (This notion corresponds to the support of (1 + ^«=i x i) d '- x e appears with a non- 
zero coefficient only if e < p d.) Extending to matrices and vectors, A G Z nx ^ with rows (A)j G Z 
and d = (di, . . . , cZ n ) 6 Z n we say A < p d if (A)j < p dj for every j G [n]. 

Let (mod* Q) denote the operation that maps non-negative integers to the set {0, . . . , Q — 1} 
as given by a (mod* Q) = if a = and a (mod* Q) = b G {1, . . . , Q — 1} if a ^ and a = b 
(mod Q — 1). (Note that if a (mod* Q) = 6, then x a = rr b (mod — x).) 

Finally, we need one more piece of notation before defining the degree sets of lifts. For matrix 
A G Z nxe , let S(A) G 7L l denote its row sum given by S(A)j = E"=i( A ) 



u ■ 



Definition 2.4 (Degree sets of lifts). For a set D <^ {0, . . . , Q — l} 1 , its rath lift, denoted Lift m (D), 
is given by 

{d G {0, ... ,Q — l} m | V E G Z mxt < p d, we have E(E) (mod* Q) G £>}. 

The following proposition makes the implied connection between lifts of codes and their degree 
sets explicit. We note that the special case where t = 1 was implicitly noted in J7j. 

Proposition 2.5. For every linear affine-invariant code T C {Fq — > F g }, and for every ra > £ ; 
we Ziaue Lift m (Deg(J 7 )) = Deg(Lift m (J 7 )). 

Proof. Before we begin the proof, we first define some notation. For a matrix E, let E T denote 
the transpose of E, so that (E)jj = (E T )jj. For an integer d and vector e = (ei,...,ej), define 
( d ) = — i r , . d ' rr, the standard multinomial coefficient and also the coefficient of x e in 

Ve/ ei!---et!(d— ei e n )! ' 

the expansion of (1 + X^i=i a; «) a! - Extending this notation, for a vector d = (di,...,d m ) and a 
matrix E G Z mxt with rows ei,. . . ,e m , define ( d ) = UT=i ©■ We wil1 use the fact that (e) ^ 
(mod p) if and only if E < p d (see Lemma |B.2|) . Finally, for two matrices A, E G Z mx *, define 
A E = Yii j . For convenience, let F = ¥ q and let K. = Fq. 

Now, we begin the proof. There exist Uj G K*, G F such that / G T <J=^> \/i ^ \jf(t%j) = 0. 
The assertion then follows from the following equivalences: 

d G Deg(Liftm) Lemmata VA G K Tr(Ax d ) G LiftfJ 7 ) 

VAGKVAeK mxt VbeK m Tr(A(Ax + b) d ) G J" 
VA, A, b Vi ^ Xij Tr(A(At.y + b) d ) = 



j 



Lcmma«[El VA> A> b Tr ( A ]T (3) A E b d ~ E ^ T > £ E W 

\ E< p d ^ ' j t 

VA, bVi ^ (f) A E b d - s ( ET ) ^ Ayt^W = 

-(?.<- A \ / 



VE < p d Vi J3 *y *y E(E) = 
j 

VE < p d E(E) (mod* Q) G Deg(J") 



□ 
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The definition of Lift m (D) is somewhat cumbersome and not easy to work with. However in 
the upcoming sections we will try to gain some combinatorial insights about it to derive bounds on 
the dimension of the codes of interest. 

Finally, before concluding we mention explicitly the locality properties of lifted codes. 

Proposition 2.6. Let T C {Fq — > ¥ q } be a linear affine-invariant code. Let C = Lift m (J 7 ) be its 
m-ary lift. Then Lisa (Q*, \Q~ l )-LCC and a (Q*, n(Q~ 2t ), Q-*)-LTC. 

Proof. Given / : Fq — > ¥ q that is Q~*/3-close to p £ £ and a G Fq, the local decoding algorithm 
works as follows: Pick random linearly independent bi, . . . bt G Fq and let h : Fq — > ¥ q be given by 
h(ui, . . . ,u t ) = /(a + itibi + • • • + u t h t ). Compute g G T such that 5(h,g) < Q~ l and output g(0). 

We show that the decoder succeeds with high probability. Let A be the i-dimensional subspace 

A = {a + Uibi H + u t h t | u G Fq}. By Markov's inequality, Pr bl) .., )b( [5(}i,p\a) > Q~ l ] < 1/3. 

So, with probability at least 2/3, <5(<7,pU) < 6(g, h) + p| ^4.) < 2Q~* and hence ^ = p\a, so the 
decoder outputs g(0) = p\a(0) = p( a ) as desired. 

The local testability follows directly from \24\ Theorem 2.9]. □ 

3 Constructions 

3.1 Codes of constant locality 

In this section we prove Theorem 11.21 which promised binary codes of locality k and length N with 
dimension ilfe(log N) k ~ 2 . 

The Code: Fix k = Q = 2 e and N = 2 mi . Let T\ C {Fq -)■ F 2 } be the code given by 
{/ : Fq — > ¥2 I ^2 ae ^ Q f{ot) = 0}. Let C\ = Lift m (.Fi). In what follows we verify that C\ has the 
properties claim in Theorem II .21 

We start with some obvious aspects. 

Proposition 3.1. C\ is a code of length N and a (k, k -1 /3)-LCC and a (k,Q(k~ 2 ),k~ 1 )-LTC. 

Proof. The length is immediate from the construction. The local correct ability and testability 
follow from Proposition 12.61 □ 

The main aspect to be verified is the dimension of C\. We first describe the degree set of T\. 

Claim 3.2. Deg(Ji) = {0, . . . , Q - 2}. 

Proof. Write / : Fq -> F 2 as /(*) = £?=o f d x*. Then E aS F Q /(«) = E a&Q E2="o ft** = 
Yld=o fd (j2 a &¥ Q ad ) = _ /q-i where we have used the fact that Y^ a & Q a d = -lii d = Q -I and 
is equal to 1 otherwise. Therefore / G T\ if and only if deg(/) < Q — 1. □ 

The next claim interprets the definition of Lift m (-D) in our setting. 

Claim 3.3. d G {0, . . . , Q — l} m is contained in Lift, n (Deg(J-"i)) if and only if for every e < 2 d we 
have YliLi e i ( m od* Q) ^ Q — 1. 

Proof. Follows immediately by applying Proposition 12.51 to Claim 13.21 □ 
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Given the claim, it is simple to get a lower bound on the dimension of our code. 
Lemma 3.4. The dimension of C\ is at least (q™ 2 ) ■ 

Proof. For S C [m] let dg denote the vector that is one on coordinates from S and zero outside. It 
is clear that for \S\ < Q — 2, ds G Lift m (Deg(J 7 i)) and there are at least (q™ 2 ) sucn se * s - ^ 

Proof of Theorem \1.2l Theorem 11.21 follows Proposition 13.11 and Lemma f3.4l and plugging the values 
of to and Q from the construction. Specificalle we have that the dimension of the code is at least 
(q-J - F^l( 1 °S iV ) fc ~ 2 - So the theorem follows for c k = p-^y. □ 

3.2 Codes of sublinear locality 

Next we turn to Theorem 11.31 which asserts the existence of codes of locality N € with dimension 
N-N 1 -''. 

The Code: Fix e > and let m = \l/e\. Let £ be an integer such that 2 mi > N. Let Q = 2 e . 
Let F 2 C {F Q -> F 2 } be the code {/ : F Q -> F 2 | >J n . V() /(a) = 0}. Let £ 2 = Lift m (J- 2 ). 
As usual we get the following proposition. 

Proposition 3.5. Ti is a code of length at least N and locality at most N e . Specifically it is a 
(N e ,fl{N~ e ))-LCC and a (N e , tl(N~ 2t ), N~ e )-LTC. 

We now turn to the task of analyzing the dimension of this code. We first describe the degree 
sets of J-" 2 and £ 2 . 

Claim 3.6. Deg(T 2 ) = {0, . . . , Q - 2} and 

Deg(£ 2 ) = {de{0,...,Q - l} m | Ve < 2 d, ^ e, (mod* Q) £ Q - 1}. 

i 

Proof. The first part follows from the proof of Claim 13.21 The second part follows immediately 
from Proposition 12.51 □ 

Lemma 3.7. The dimension o/£ 2 is at least N — N 1 " 6 ' for some e' = S7(2 — 2 / e ) . 

Proof. Let D = Deg(.F 2 )- Let e = (e i , . . . , e m ) and , , . . . , 1 denote the binary expansion 
of ej. 

Claim 3.8. If there exists integer s £ {0, ...,£— 1} such that for every i £ [m] and every j £ 
[1 + |~logm~|] we have e s ^ ( mod £ ) = q ; then e £ Lift m (D). 

Proof. We first note that e E Lift m (D) if and only if its binary rotation e', given by e' = 
(2ei (mod* Q), . . . , 2e m (mod*Q)), belongs to Lift m (D). Thus without loss of generality we can 
assume (by shifting e appropriately), that the block of zeroes are the most significant digits in the 
ej's. (i.e., s = m — [log to] — 2.) With this assumption, we now have < 2^ _1 ° sm_1 < Q/(2m) < 
(Q — 1)/to. We thus conclude that for every f < 2 e, YlT=\fi — YlT=i e i < Q — 1 and so (by 
Claim ESD e G Lift m (L>). □ 
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The lemma follows by an easy counting argument. Let t = 1 + [log m] . We partition the set [£] 
into i/t blocks of t successive integers each. For each such block the number of possible assignments 
of bits that do not make the entire block zero in each is 2 mt — 1 . Thus the total number of vectors 
e that do not have any of these blocks set to zero is (2 mt - 1)'/* = 2 m£ {l - 2" m *)^* « 2 me e e ^ t2m ^ = 
2 me(i-n(i/mt2^)) = N i-e' for £ / = i/( mt2 m *). Recalling that e = 1/m, we have e' = Q(2~ 2 / e ). 
The lemma follows by noting that if e Lift m (D) then in each of these blocks it must be non-zero 
somewhere (by Claim [3781 above) . □ 

Proof of Theorem \l.'A Theorem 11.31 follows immediately from Proposition 13. 51 and Lemma [3. 71 □ 



3.3 Codes of linear locality 

Finally, we prove Theorem 11.41 which claims codes of locality eN with dimension N — poly log N. 
This construction is different from the previous two in that here we lift a multivariate code, whereas 
in both previous constructions we lifted univariate codes. 

The Code: Let I = [logl/e] (so that 2~ e < e). Let Q = 2 e . For integer m let N = 2 nd and 
let t = m-l. Let JF 3 C {F* Q -> F 2 } be given by T 3 = {/ : F^ -> F 2 | J2 a&t f(a) = 0}. Let 
£ 3 = Lift m (J3). 

Proposition 3.9. £3 is a code of block length N with locality eN. Specifically, it is a (eN, ^(eiV)^ 1 )- 
LCC and a (eN, (eN)-2, {eN)~ l )-LTC. 

The proposition below asserts that every degree except the vector that is Q — 1 in every coor- 
dinate is in the degree set of F3. (Here (Q — 1)* denotes the t-tuple all of whose entries is Q — 1, 
rather than (Q — 1) exponentiated to the t-th power). 

Proposition 3.10. Deg(J" 3 ) = {0, . . . , Q - 1}* - {(Q - 1)*}. 

Proof Write / : F* Q -> F 2 as /(x) = Ed 6 {o,...,Q-i}* /d* d Then £ QeFb f(a) = E a eF h Ed / d x d = 

Ed fd (EaeF^ « d ) = Ed fd rii=l (EaeFg = (-^/(Q-iy where we have used the fact that 

EaeF a<i = — 1 if cZ = Q — 1 and is equal to otherwise. Therefore / S J-3 if and only if 

f( Q -iy = 0. □ 

While in general degree sets of lifts of multivariate families are not easy to characterize, in this 
particular case we have a clean characterization of the degree set. 

Given e = (ei, . . . , e m ) let e>p denote the jth bit in the binary expansion of e^. Let M(e) denote 
the m x i matrix with entries M(e)jj = ef\ 

Lemma 3.11. e G Lift m (Deg(J r 3)) if and only if there exists a column in M(e) with at least two 
zeroes. 

Proof. As in the proof of Lemma [3.71 we have that e £ Lift m (Deg(J 7 3)) if and only if 2e (mod* Q) £ 
Lift m (Deg(J 7 3)). So without loss of generality we can assume that e is shifted so that the two zeroes 
are in the most significant bits. Thus we have that m — 2 of the e^'s, say e\, . . . , e m _ 2 , are at most 
Q — 1 and the remaining two are at most Q/2 — 1. We thus have that Ei^i e « < ( m — ~ 1- 
Using this and applying Proposition 12.51 it is easy to verify that e is not in Lift m (Deg(J r 3)). □ 
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The following lemma now follows by simple counting. 
Lemma 3.12. The dimension of £3 is 2 m£ — (m + 1) . 

Proof of Theorem \l-4\ Follows by plugging in the values for the parameters, specifically by setting 
£ = log 1/e and m = (log N/ log 1/e). We get that the dimension of £3 is N— (1+log Nf log l/e) logl / e . 

□ 

We remark that the construction in [5] is very close in parameters. In their construction (i.e., 
the Reed-Muller codes) the matrix M(e) must have at least I + 1 zeroes. Since any such matrix 
must have two zeroes in a single column it follows that every matrix their construction admits is 
also admissible in ours, while our allow for other matrices also. However the difference between the 
length and dimension is at most a constant factor (depending on £) . (More precisely, the dimension 
of their code is 2 mi — Yli=o ("lO ~ — (em)'.) Of course, for their application the code needs to 
have much better local testability than given here. But the local testability given here is just what 
follows immediately from the definition and previous works, and it is quite possible that better 
bounds can be achieved by more careful examination of this code. 

3.4 High-rate high-error LCCs 

Finally, we prove Theorem 11.51 This construction is a departure from the others in that the code is 
not binary, and the code being lifted is not the parity check code. Finally the decoding algorithm 
is a bit more complex to explain, though even this algorithm is by now folklore. 

The Code: Recall that we are given 5, e and some Nq and we wish a code of length N > Nq 
of dimension (1 — e)N and locality N s . Let m = \l/S~\ and s be such that Q = 2 s > Nq. 
Let b = 1 + [logm] and c = \b2 bm log 1/e] . Let 7 = 2~ c and r = 7/6 (so that 6r < 7 < 
e -(i+riogml)2™( 1 +no g mi) ^ ^ d = Q_ 2 - C )q Let jc 4 = {f . ¥q ¥q 1 deg (/) < d}. Let 

£4 = Lift m (J-4). 

Decoding: The general idea for decoding £4 is the same as that for multivariate polynomials, 
and in particular the algorithm from Gemmell et al. |13j . 

Given / : Fq —> Fq that is r-close to p E £4 and a S Fq, the decoding algorithm works as 
follows: Pick a random b £ Fq and let h : Fq — > Fq be given by h(t) = /(a + ib). Compute, using 
a Reed-Solomon decoder (see, for instance, [HI Appendix]), a polynomial g G F<g[i] of degree at 
most d such that 5(h,g) < 7/2. Output g(0). 

Lemma 3.13. £4 is a code of block length N with locality N s . Specifically, it is a 

Proof. Let L = {a + th \ t € Fq — {0}} be the line through a with slope b. We first claim that 
with probability at least 2/3, the line L contains fewer that 7/2 fraction errors (i.e., points t 7^ 
such h{t)^p\ L {t)). 

Claim 3.14. For every a, ~Pi\ 3 [5{h,p\i) > 7/2] < 2t/j. 

The above claim follows easily from an application of Markov's inequality. Next we note that 
if the fraction of errors on L is less than 7/2 then the decoder satisfies g = p\l an d so outputs 
g(0) =p|l(0) = p(a) as desired. □ 
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Next we turn to the analysis of the dimension of £4 which is similar to the analysis of £2 • First 
we note the obvious fact. 



Proposition 3.15. Deg(J r 4) = {0, . . . ,d} and 

( m 

Deg(£ 4 ) = dG{0,...,Q-l} m | V e < 2 d,^e; (mod* Q) G {(),... ,d} 



I. i=l ) 

Lemma 3.16. The dimension of £4 is at least (1 — e)iV. 

Proof. For non-negative integer b, let b^ denote its binary expansion so that b = Y^jb^. Recall 
d = (1 — 2~ C )Q. Letting d^') denoting its binary expansion, we note an integer e G {0, . . . , Q — 1} 
is at most (i if (and only if) one of the bits e^ s ~ c \ . . . , e^" 1 ) is zero. We use this to reason about 
Deg(£ 4 ). 

Let d = (d±, . . . , d m ) and let dp denote the jth bit in the binary expansion of dj. 

Claim 3.17. Let 6=1 + [log m] . // there exists j G {s — c, . . . , s — b} such that for every i G [m] 
and every t G {0, . . . , b — 1} we have d^ +£ ^ = 0, then d G Deg(£4). 

Proof. Let e = (ei,...,e m ) < 2 d and let e = YT=i e i (mod*Q). We claim that e^' +fe " 1 ) = 0, 
which suffices to show that e < d. Let = 2 s ~^ +b )ej (mod* Q) for all i G [m], and let e = 
YliLi &i (mod*Q). For every i G [m] and every k G [s], = e^ fc+s ^ +6 ^ (mods)) similarly 
e (k) = -(k+s-(j+b) (mod «))_ Therefore it suffices to show that et a_1 ) = or equivalently e < 2 s - 1 . By 
our assumption on d, e^ +i ^ = for all I G {0, . . . , b — 1} , so = for all k G {s — s — 1} and 

thus e./ < 2 S ~ 6 for all i G [m]. By our choice of 6, m < 2 b " 1 , and thus YT=i g i < m2 s " 6 < 2 s " 1 . □ 

We now consider picking d at random. By partitioning the c most significant bits into disjoint 
blocks of b bits each, we get that any such block is all zero with probability at least 2~ mb . Thus the 
probability there exists a block which is all zero is at least 1 — (1 — 2~ mb ) c / b > 1 — e~ c ^ b2m '. By 
choice of c we have that c/(b2 mb ) > ln(l/e) and so e _c /( fe2mi> ) < e an d thus the dimension is lower 
bounded by (1 - e)N. □ 

Proof of Theorem \1.5[ Follows immediately from Lemmas 13.131 and 13.161 □ 

We remark that the construction of this section is somewhat contrary to folk belief, which tends 
to suggest that generalized Reed-Muller codes (evaluations of m-variate polynomials of degree at 
most d) are equivalently defined by requiring that their restriction to lines are Reed-Solomon code- 
words (evaluations of univariate degree d polynomials). As pointed out earlier this folk statement is 
true only with some restrictions on d and Q, and our construction benefits by violating the restric- 
tions. While the fact that there exist functions that are not degree d polynomials, for d > Q — Q/p, 
which are degree d polynomials on every line has been known for a while [12], presumably it was 
suspected that the effect on the dimension of the lifted family was negligible. Fortunately for this 
work, this presumption turned out to be false. 

We also give below an example of some concrete setting of parameters for which this construction 
works. 

Example 3.18. For every iV = 2 2n , for n > 7, there exists a code of length iV over the alphabet 
F2« of dimension .77N that is decodable from 0.26% fraction errors with y/N queries 
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The example is obtained by setting c = 6, m = 2 and Q = 2™ in the construction. The fraction 
of errors is 2~ 6 /6 ~ 0.26%. The rate follows from the following claim. 

Claim 3.19. The dimension of the code is at least ((4 C - (5/4)3 c + l/4)/4 c )iV. 

While the error-correction rate of the code is smaller than that in [26], it does seem to start 
working at much smaller lengths and with much smaller alphabet sizes. 

4 General investigation of lifting 

The codes of the previous section simply picked some basic codes and lifted them to derive long 
codes of reasonable distance and interesting local testability and decodability. To go beyond this 
setting, we feel it is important to pick basic codes of (very) good distance and then lift them, and 
this could improve the performance of such codes. As may be observed from the previous section 
most of the work needed to analyze lifted codes is devoted to determining their dimension, and this 
can be a function of the exact code chosen. Features such as distance, decodability, and testability 
seem to follow more generically. In this section, we examine the simplest of these properties, namely 
the distance of the lifted code and prove some basic facts. 

Theorem 4.1. Let T C {Fq — > F q } and C = Lift m (J r ) for some m > t. We have the following: 

1. 5(C) < 5(T). 

2. 5(C) > 6{T) - Q-*. 

3. IfQe {2,3} and S(T) > then 5(C) > 5(T). 
4.1 Proof of Theorem 14.11 

We divide the proof of Theorem 14. II into several parts. We start by proving that distance does not 
increase under lifting (Theorem 14.11 Part [I]). 

Lemma 4.2. Let T C {Fq — > F q } be a linear affine-invariant code with lift C = Lift^J 7 ). Then 
5(C) <5(T). 

Proof. By induction, it suffices to show the assertion for the case m = t + 1. Let / 6 J and let 

5 = 5(f,0). Let x = (xi, . . . , xt) ■ Now consider the function g(x,y) = /(x). Clearly we have 
5(g, 0) = 5. We claim that g G C, which completes the proof. To do so we will show that g\n G J 7 
for every i-dimensional affine subspace H C Fq. Fix such a subspace H and let A : Fq — > Fq be 
an affine map whose image is H (such a map does exist). Note that <?|ir(z) = f(A(z)±, . . . , A(z) t ). 
Thus if we let A' : Fq — > Fq be the affine map given by the projection of A to its first t coordinates, 
we have that g\n = f ° A' . By Theorem lA.il / o A' G J- and so we have g G J- as claimed. (Note 
that we need to use Theorem IA.1I since A' need not be an affine permutation but it is an affine 
transformation.) □ 

Next we prove Part [3] of Theorem 14.11 which asserts that the distance of non-trivial binary codes 
does not decrease with lifting. 

Lemma 4.3. // T C {F2 ->■ F 2 } has distance 5(7^) > ^, then <5(Lift m (J")) > 5(7^) for all m > t. 
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We prove the above lemma by stating and proving the following stronger lemma first. 

Lemma 4.4. For allm>2,ifS> -^k=T and f : F^ 1 — > F 2 such that < Pr xeV m,[f (x) 7^ 0] < 8, then 
there exists an (m — 1)- dimensional affine subspace H C F™ such that < Pr xe ^[/(x) 7^ 0] < 5. 

Proof. We proceed by induction on to. The base case to = 2 is straightforward to verify. 

Now suppose to > 2 and our assertion holds for to — 1. Let Hq, Hi be the affine subspaces given 
by x m = and x m = 1 respectively. Let So, 8% denote 8(f\n , 0), 5(f\n 1 , 0) respectively. Note that 
5 > 5(f, 0) = (<5o + <5i)/2. If both <5o, o~i > 0, then by averaging we have < 5i < 5 and so H = Hi 
does the job. Otherwise, suppose w.l.o.g. that 5\ = 0. Note that < 5q < 25 and 25 > 2m _> 2 ■ Thus, 
by the induction hypothesis, there exists an (to — 2)-dimensional affine subspace H' C Hq such that 
< 5(f\ H ',0) < 25. Let H[ = {(ai, . . . ,a m _i, 1) G FJf | (ai, . . . ,a m _i,0) G Fqj be tne translate 
of //q in H\, and note that , 0) = 0. Let H = H Q U H[. Then H is an (m — l)-dimensional 

subspace of F^ such that < 5{f\ H ,0) = {5{f\ H ^,0) + <5(/|^,0))/2 < 5. □ 



Proof of Lemma \4-3[ We prove the lemma by induction on m — t. Indeed the inductive step is 



straightforward since Lift. m (J-") = Lift m (Lift m _i(J r )) and by induction both lifts on the RHS have 
smaller value of m — t and so the distance does not reduce in either step. The main case is thus 
the base case with m = t + 1. 

Suppose / G Lift m (J r ) C {F™ -»• F 2 } such that < 5(f, 0) < 5(T). By Lemma WM there exists 
an (to — l)-dimensional affine subspace H C F™ such that < 5(f\n,0) < 5(f, 0) < 5, contradicting 
the fact that f\fj G T . □ 

A similar approach works for q = 3, thus we have the following. 

Lemma 4.5. If T Q {F 3 -»■ F 3 } /ias distance ^(J 7 ) > <p- ; i/ieri ^(Lift^J 7 )) > ^(J 7 ) /or a// m>t. 

Again, we prove this by stating and proving the following analogue of Lemma l4.4i 

Lemma 4.6. For all to > 2, if f : F™ — > F3 such that 5(f,0) > , then there exists an 
(to — 1)- dimensional affine subspace H C F™ such that < <5(/|#,0) < 5(f,0). 

Proof. Let J = 5(f,0). We proceed by induction on to. For the base case to = 2, <5 > |. Suppose 
/ = f{x,y) and consider /|j,=j for i G F3. If /| y =i is not identically zero for all i G F3, then by 
averaging there is some i G F3 for which < Pi x£ r 3 [f(x, i) 7^ 0] < 5. Otherwise, w.l.o.g. suppose 
f\ y =2 = 0. Further, w.l.o.g. suppose f\y=o ^ and /(0, 0) 7^ 0. Now, if <5 > |, then the line 
H = {(x, y) G F§ I x = 0} does the job, since < P% eF3 [/(0, y) / 0] < | < 5. If 5 < |, then there 
must exist some a, b G F3 and c G {0, 1} such that f(a,c) 7^ and f(b, 1 — c) = 0. Then the line 
H = {(a, c), (6, 1 — c), (26 — a, 2)} does the job, since < P?( x ,y)£H[f( x i V) / 0] = 3 < 5. 

Now suppose to > 2 and the assertion holds for m — 1. For i G F3, let Hi be the hyperplane 
cut out by x m = i and let 5i = 5(f\Hi,0)- Then 81 + 02 + £3 = 35. If <5j > for all i G F3, then 
by simple averaging for some i G F3 we have < 5i < 5, so assume w.l.o.g. 52 = and #0 > <5i. 
First suppose 5q > 3 J_ 2 . Then, by the inductive hypothesis, there exists an (to — 2)-dimensional 
affine subspace H C Hq such that < 5(f\n,0) < <5i- Let flw be defined by the linear equations 
YliLi a i x i ~ a = an d x m = for some (oq, . . . , a m ) G F™ 4 " 1 . For each £, j G F3, let + j C Hi 
denote the affine subspace defined by YliLi a i x i ~ a o = j and x m = i. By averaging, for some 
i G F 3 , 5(f\ H(1)+i ,0) < 8 2 . Take H = U (tfW + i) U (^ 2 ) + 2t). Then < 5(f\ H ,0) < 5. 
Otherwise, suppose > 8q, so <5q,<5i < „ m 2 _ t . There exists H^ C -ffo be an (to — 2)-dimensional 



14 




forward since Lift m (J r ) = Lift m (Lift. m _i(J-")) and by induction both lifts on the RHS have smaller 
value of m — t and so the distance does not reduce in either step. The main case is thus the base 
case with m = t + 1. 

Suppose / G Lift m (J r ) C {Ff -»• F 3 } such that < <J(/,0) < ^(J 7 ). If 5(f,0) > p^r, 
then, by Lemma 14.61 there exists an (m — l)-dimensional affine subspace H C F™ such that 
< 5(f\ H ,0) < S(f,0) < 5(F), contradicting the fact that f\ H G F. If <5(/,0) < ^rr, then 
there are at most two points a,b G F™ such that f(a),f(b) are nonzero. Let i G [to] such that 
a, ^ fej and let be the hyperplane defined by Xj = Oj. Then /|# is nonzero only on a, so 
< S(f\jj) = 3 m-i < < ^(^ r )) again contradicting the fact that f\u G J 7 . □ 

For general (? > 3, we have the following. 

Lemma 4.7. If T Q {Fq — >■ F g } /ias distance 5 (J 7 ) = 5, then 5(Lift m (J")) > 5 - ^jr^y- 

Proof. Fix a non-zero / G Lift. m (J r ) and let r = <5(/, 0). Fix a G Fq such that f(a) / 0. Now 
let A be a t-dimensional affine subspace containing a chosen uniformly at random from all such 
subspaces. Let X(A) = \{x G A \ f(x) ^ 0}| be the random variable denoting the number of 
non-zero points of / on A. Since A samples every point of Fq — {a} uniformly, we have 



Therefore there must exist a i-dimensional subspace A containing a with X(A) < r(Q t — 1) + 1. 
Since f\A is a non-zero function in J 7 , we have r(Q t — 1) + 1 > 5Q l and thus we conclude that 
t > 5 — tttzt- In other words every non-zero function in T is non-zero on 5 — Xr\ fraction of the 



Finally we mention examples which show that, in some senses the gaps in Theorem 14. 1( 
Parts [2] and [3] are inherent. 

First note that if F = {Fq ->■ FJ then Lift m (J r ) = {F^ -> ¥ q } whose distance is Q~ m , and 
so some loss in the distance is inherent in Part [2] of Theorem 14.11 However, one could hope that if 
F C {Fq — > ¥g} then its distance is preserved by lifting (as in Par t [3l of Theorem 14 . 1 p . Unfortunately 
(actually fortunately, since this is where the rate improvement of codes in Theorem 1 1 . 21 comes from) 
even this hope is not true. If one takes F to be the binary code with degree set being all weight one 
integers, then its lift contains all the weight one integers as well as some integers of weight greater 
than one. The code consisting of only weight one integers in its degree set has distance exactly 1 /2 
while codes that have rate greater than these must have distance strictly smaller than 1/2 (by the 
Plotkin bound). This suggests that distances can reduce under lifts. A search reveals that the code 
F C {F4 — > F2} with degree set T)eg(F) = {0, 1,2} has distance 1/2 while its lift C = Lift^-F) has 
distance 3/8. 



E A [X(A)} = 1 + 



TQ m - 1 
Q m - 1 



(Q*-l) <l + r(Q*-l). 



points, as asserted. 



□ 
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A Equivalence of invariance under affine transformations and per- 
mutations 

In their work initiating the study of affine-invariant properties (codes) , Kaufman and Sudan [23] 
studied properties closed under general affine transformations and not just permutations. While 
affine transformations are nicer to work with when available, they are not mathematical elegant 
(they don't form a group under composition). Furthermore in the case of codes they also do not 
preserve the code - they only show that every codeword stays in the code after the transformation. 
Among other negative features affine transformations do not even preserve the weight of non-zero 
codewords, which can lead to some rude surprises. Here we patch the gap by showing that families 
closed under affine permutations are also closed under affine transformations. So one can assume 
the latter, without restricting the class of properties under consideration. We note that such a 
statement was proved in [6] for the case of univariate functions. Unfortunately their proof does not 
extend to the multivariate setting and forces us to rework many steps from |24j . 

Theorem A.l. If JF C {Fq — > ¥ q } is an ¥ q -linear code invariant under affine permutations, then 
T is invariant under all affine transformations. 

The central lemma (Lemma IA.2[) that we prove is that every non-trivial function can be split 
into more basic ones. This leads to a proof of Theorem IA. II fairly easily. 

We first start with the notion of a basic function. For Q = q n , let Tr : Fq — > ¥ q denote the trace 
function Tr(x) = x + x q + • • •+x qn 1 . We say that / : Fq — > ¥ q is a basic function if /(x) = Tr(Ax d ) 
for some d € {0, . . . , Q — l} m . For T C {Fq — > ¥ q } and / 6 J- we say / can be split (in J 7 ) if there 
exist functions g and h such that f = g + h and supp(g), supp(/i) C supp(/). 

Lemma A. 2. If J- C {Fq — > ¥ q } is an ¥ q -linear code invariant under affine permutations, then 
for every function f £ T , f is either basic or f can be split. 

We first prove Theorem I A . 1 1 from Lemma lA.21 

Proof of Theorem \A.li First we assert that it suffices to prove that for every function / G JF the 
function / = f(xi, . . . ,x m -i, 0) is also in T. To see this, consider / G T and A : Fq — > Fq 
which is not a permutation. Then there exists affine permutations B,C : Fq — > Fq such that 
A(x) = B(C(x)i, . . . , C(x) r , 0, . . . , 0) where r < m is the dimension of the image of A. By closure 
under affine permutations, it follows / o C £ J. Applying the assertion above m — r times we have 



18 



that /'(x) = / o C(x\, . . . , x r ,0, • • • , 0) is also in T . Finally / o A = f o B is also in J 7 . So we turn 
to proving that for every / G J 7 the function / = f{x\, . . . , x m -\, 0) is also in J 7 . 

Let /(x) = ^ d CdX d . Notice /(x) = XZd|d m =o c d xd - Writing / = / + /i, we use Lemma [A.2I to 
split / till we express it as a sum of basic functions / = Y2i=i ^i, where each 6j is a basic function 
in J 7 . Note that for every 6j, we have supp(6j) C supp(/) or supp(6j) C supp(/i) (since the trace 
preserves d m = 0). By reordering the 6j's assume the first M fej's have their support in the support 
of /. Then we have / = Xw=i h £ J 7 . □ 

We thus turn to the proof of Lemma I A. 2 1 We prove the lemma in a sequence of cases, based 
on the kind of monomials that / has in its support. 

We say that d and e are equivalent (modulo q) , denoted d = q e if there exists a j such that for 
every i, di = q J ei (mod* Q). The following proposition is immediate from previous works (see, for 
example, [6]). We include a proof for completeness. 

Proposition A. 3. If every pair d,e in the support of f : Fq — > ¥ q are equivalent, then f is a basic 
function. 

Proof. We first note that since the Tr : Fq — > ¥ q is a (Q/q)-to-one function, we have in particular 
that for every (3 £ ¥ q there is an a £ Fq such that Tr(a) = f3. As an immediate consequence we 
have that every function / : Fq — > ¥ q can be expressed Trog where g : Fq — > Fq. Finally we note 
that we can view g as an element of Fq[x], to conclude that / = Tr og for some polynomial g. 

Now fix / : Fq — > ¥ q all of whose monomials are equivalent. By the above we can express 
/ = Trog for some polynomial g. By inspection we can conclude that all monomials in the 
support of g are equivalent to the monomials in the support of /. Finally, using the fact that 
Tr(ax d ) = Tr(a^ qd ( mod *Q)) we can assume that g is supported on a single monomial and so 
/ = Tr(Ax d for some A G Fq. □ 

So it suffices to show that every function that contains non-equivalent degrees in its support 
can be split. We first prove that functions with "non- weakly-equivalent" monomials can be split. 

We say that d and e are weakly equivalent if there exists a j such that for every i, di = q 3 ei{ 
mod Q — 1). 

Lemma A. 4. If J- C {Fq —> ¥ q } is an ¥ q -linear code invariant under affine permutations and 
f £ J- contains a pair of non-weakly equivalent monomials in its support, then f can be split. 

Proof. Let d and e be two non weakly-equivalent monomials in the support of /. Fix j and consider 
the function fj(x) = J2ae(¥* Q ) m Tl 'h^^ fi a i x ii ■ ■ ■ > a mX m ). We claim that (1) the support of fj is 
a subset of the support of /, (2) g J d is in the support of fj, (3) f is in the support of fj only if for 
every i fi = q J di{ mod Q — 1) and in particular (4) e is not in the support of fj. 

Now let b = 6(d) be the smallest positive integer such that q b di = di (mod* Q) for every i. Now 
consider the function g = X^=o fj ■ ^ e nave that g G T since it is an F^-linear combination of 
linear transforms of functions in T . By the claims about the fj's we also have that d is in the 
support of g, the support of g is contained in the support of / and e is not in the support of /. 
Expressing / = g + (/ — g) we now have that / can be split. □ 

The remaining cases are those where some of coordinates of d are zero or Q — 1 for every d in 
the support of /. We deal with a special case of such functions next. 
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Lemma A. 5. Let F be a linear affine-invariant code. Let f G T be given by /(x, y) = Tr(y d p(x)) 
where every variable in p(x) /ms degree in {0, Q — 1} in every monomial, and d is arbitrary. 
Further, let degree of p(x) be a(Q — 1). TTien /or every < b < a and for every A G Fq, £/ie 
function (xf- x b ) Q ~ l Tr(Ay d ) G J". 

Note that in particular the lemma above implies that such /'s can be split into basic functions. 

Proof. We prove the lemma by a triple induction, first on a, then on b, and then on the number of 
monomials in p. The base case is a = and that is trivial. So we consider general a > 0. 

First we consider the case b < a. Assume w.l.o.g. that the monomial {x\ ■ ■ ■x a ) < ^~ 1 is in the 
support of p and write p = po + x®~ l p\ where po,pi do not depend on x\. Note that pi ^ and 
deg(pi) = (a — 1)(Q — 1). We will prove that — Tr(y d pi(x)) G J 7 and this will enable us to apply the 
inductive hypothesis to pi. Let g(x, y) = J2/3e¥ Q f( x i + fit x 2-> • • • > x m, y)- By construction g € J 7 . 
By linearity of the Trace we have 

Po + (si + /?) Q ~V I I = Tr(y d (- Pl (x))), 

P&Q J ) 

where the second equality follows from the fact that X^£Fq( z + = —1. Thus we can now 

use induction to claim (x\ . . . xj,)^" 1 Tr(Ay d ) G T . 

Finally we consider the case b = a. Now note that since the case b < a is known, we can assume 
w.l.o.g that p is homogenous (else we can subtract off the lower degree terms). Now if a = m there 
is nothing to be proved since p is just a single monomial. So assume a < m. Also if p has only one 
monomial then there is nothing to be proved, so assume p has at least two monomials. In particular 
assume p is supported on some monomial that depends on x± and some monomial that does not 
depend on x\. Furthermore, assume w.l.o.g. that a monomial depending on x\ does not depend on 
%%. Write p = xf~ 1 p\ +x ( ^~ 1 p2 + {xiX2)®~ l pz +Pa where the p^s don't depend on x\ or xi- By as- 
sumption on the monomials of p we have that p\ ^ and at least one of P2,P3iP4 7^ 0. Now consider 
the affine transform A that sends x\ to x\ + xi and preserves all other x^s. We have g = f o A = 
Tr (y d (x®~ l pi + x®~~ l (p\ + P2) + (^i^2)'^~ 1 P3 + Pi + where the xi-degree of every monomial 
in r is in {1, . . . , Q — 2}. Now consider </(x, y) = X^«gf* 9( ax i-> x 2, ■ ■ ■ > x m, y)- The terms of r 
vanish in g' leaving g' = — (f o A — r) = Tr ( y d ( — xf~ 1 p\ — x®~ l (jp\ + p%) — (xiX2)^ _1 p3 — Pa 




Finally we consider the function g = f + g' = Tr(y d (— x®~ l pi)) which is a function in T of de- 
gree a(Q — 1) supported on a smaller number of monomials than /, so by applying the inductive 
hypothesis to g we have that J- contains the monomial (x\ • • • x a )^ _1 . □ 

The following lemma converts the above into the final piece needed to prove Lemma lA. 21 

Lemma A. 6. If T C {Fq — > ¥ q } is an ¥ q -linear code invariant under affine permutations and all 
monomials in f G T are weakly equivalent, then f can be split. 

Proof. First we describe the structure of a function / : Fq — > ¥ q that consists only of weakly 
equivalent monomials. First we note that the m variables can be separated into those in which 
every monomial has degree in {1, ... ,Q — 2} and those in which every monomial has degree in 
{0, Q — 1} (since every monomial is weakly equivalent). Let us denote by x the variables in which 
the monomials of / have degree in {0, Q — 1} and y be the remaining monomials. Now consider 
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some monomial of the form M = cx e y d in /. Since / maps to ¥ q we must have that the coefficient 
of (x e y d ) 9J is c qJ . Furthermore, we have every other monomial M' in the support of / is of the 
form c'y^x 6 • Thus / can be written as Tr(y d p(x)) where p{x\, . . . ,x m ) = p(x®~ 1 , . . . ,Xm~ l ). 
But, by Lemma |A.5| such an / can be split. □ 

Proof of Lemma \A.S\, If / contains a pair of non- weakly equivalent monomials then / can be split 
by Lemma lA.41 If not, then / is either basic or, by Lemma IA.6I is can be split. □ 

We also prove an easy consequence of Lemma IA.2I 

Lemma A. 7. Let F C {Fq — > ¥ q } be affine invariant. If d G Degp 7 ), then Tr(Ax d ) G F for all 
AgFq. 

Proof. We first claim that Lemma IA.2I implies that there exists /3 G Fq such that Tr(/3x d ) is a non- 
zero function in T . To verify this, consider a "minimal" function (supported on fewest monomials) 
/ G J- with d G supp(/). Since / can't be split in J- (by minimality), by Lemma lA.21 / must be 
basic and so equals (by definition of being basic) Tr(/3x d ). 

Now let b = 6(d) be the smallest positive integer such that q b d (mod* Q) = d. If Q = q n , note 
that b divides n and so one can write Tr : Fq — > ¥ q as Tri o Tr2 where Tri : F i — > ¥ q is the function 
Tn(z) = z + z q + • • • + zi"" 1 and Tr 2 : F Q ->■ ¥ qb is the function Tr 2 (z) = z + z^" + ■ ■ ■ + z Q l qb . 
(Both Tri and Tr2 are trace functions mapping the domain to the range.) It follows that Tr(/3x d ) = 
TrxCTr^^). 

We first claim that Tri(rx d ) G T for every r G F 9 &. Let S = {^«g(F^) m a a ■ a d \ a a £ ¥ q }. 

We note that by linearity and affme-invariance of J 7 , we have that Tri(Tr 2 (/3) • r/x d ) G T for every 
rj G S. By definition S is closed under addition and multiplication and so is a subfield of Fq. In 
fact, since every rj G S satisfies rj q = r\ (which follows from the fact that a d = a q d ), we have that 
S C ¥ q b. It remains to show S = F i. Suppose it is a strict subfield of size q c for c < b. Consider 
ry d i for *y g Fq and i G [m]. Since j di G S, we have that ^ diqC = ~/ di for every 7 G Fq and so we 
get x\ d * = Xi mod (xf — xi). We conclude that x 9Cd = x d (mod x^ — x) which contradicts the 
minimality of b = 6(d). We conclude that S = ¥ q b. Since Tr2(/3) G F* 6 , we conclude that the set of 

coefficients r such that Tri(rx d ) G J- is all of F & as desired. 

Finally consider any A G Fq. since Tr 2 (A) G ¥ q b, we have that Tri(Tr 2 (A)x d ) G J- (from the 
previous paragraph), and so Tr(Ax d ) = Tri(Tr 2 (A)x d ) G T □ 

B Coefficients of multinomial expansions modulo a prime 

For an integer d, let d^' denote the ith. digit in the p-ary expansion of d, so that d = d^p 1 . 
Let = p denote equivalence modulo p. The following is a well known theorem of Lucas. 

Theorem B.l (Lucas' theorem). Ifd,e€ Z, then (fj = p fTj (*)• 

In particular, (^) ^ (mod p) if and only if e < p d, so we have {x + y) d = p X^ e < p d x e y d ~ e . More 

generally, we would like to know when (^) vanishes modulo p. To this end, we use the following 
claim. 
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Lemma B.2. If d G Z and e £ Z', then ( e ) ^ p on/y i/e < p d. More generally, if d G Z m and 
E G Z mx *, i/ien ( d ) ^ p on/n i/E < p d. 

Proof. We have (^) = Hi=i ( d ^ =l6j )- For this to be nonzero modulo p, by Lucas' theorem we 
have ej < p d — X^'=i e j > fr° m which it follows that e < p d. The more general statement then follows 
immediately from definition. □ 

Lemma B.3. Let A G Z mxi and let d,b G Z m and x G Z*. TTiera 

(Ax + b) d = £ [^A E x s ( E )b d - s ( ET ). 

E 

Proof. For matrices A, E, let aij,eij denote their entries respectively. The lemma follows by 
straightforward calculation. We have 



m I t 



(Ax + b) d = n(£ a ^ + bi 

»=i \j=i 



•••••« ' u 

m t \ I t 



E \i=lj=l / \j=l 

= E(E) AExS(E)bd ~ E(ET) - 



□ 
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